Marker-assisted Selection in 
Backcross Breeding 

SJ. Openshaw 

Pioneer Hi-Bred Ml Inc. EO. Box 1004, Johnston, IA 50131 
S.G.Jarboe 1 

CIMMYT, Lisboa 27, Apdo. Postal 6-641, 06600 Mexico, D.E, Mexico 
W.D.Beavis 

Pioneer HUBred Intl Inc., P.O. Box 1004, Johnston, IA 50131 

Abstract. The backcross breeding procedure has been used widely to transfer simply Inherited traits into elite genotypes. 
Genetic markers can increase the effectiveness of backcrossing by 1) increasing the probability of obtaining a suitable 
conversion, and 2) decreasing the time required to achieve an acceptable recovery. Simulation and field results indicated 
that, for a genome consisting of ten 200- cM chromosomes; basing selection on 40 or 80 markers In 50 BC Individuals that 
carry the allele being transferred can reduce the number of backcross generations needed from about seven to three. 



The backcross breeding procedure has been used widely 
to transfer simply inherited traits into elite genotypes. 
Usually, the tralc being transferred is controlled by a 
single gene, but highly heritable craits that are more complexly 
inherited have also been transferred successfully by backcross- 
ing; for example, maturity in maize (Rlnke and Sentz, 1961; 
Shaver, 1976). Today, backcrossing is being used to transfer 
genes introduced by such techniques as transformation or 
mutation into appropriate germplasm. 

Several plant breeding textbooks give good descriptions of 
the backcross procedure (Allard, 1960; Fehr, 1987). A donor 
parent (DP) carrying a trait of interest is crossed to the recurrent 
parent (feP), an elite line that is lacking the trait. The F, is 
crossed back to the RP to produce the BC, generation. In the 
BC, and subsequent backcross generations, selected Individu- 
als carrying the gene being transferred are baclccrossed to the 
RP, The expected proportion of DP genome is reduced by half 
with each generation of backcrossing. Ignoring effects of link- 
age to the selected DP allele being transferred, the percentage 
recurrent parent (%RP) genome expected in each backcross 
generation is calculated as: 

^Rp^iooci-co.sr 1 ] 

where n is the number of backcrossas. 

Backcrossing of selected plants to the RP can be repeated 
each cycle until a line is obtained that is essentially a version of 
the RP that includes the introgressed allele. After six back- 
crosses, the expected recovery is >99% (Table I). 

Until recen tly , discussions of the recovery of the RP genome 
during backcrossing have emphasized the expected values for 



'Formerly vrich Purdue Univ«rtky, Wen La/aynCto, bid, 

Analysts of Molecular Marker Daia 



%RP shown in Table I, and have largely ignoxtd the genetic 
variation for %RP that exists around the expected mean. With 
the development of genetic markers capable of providing good 
genome coverage, there has been interest in taking advantage of 
that variation to increase the efficiency of backcrossing. 

Selection for RP marker alleles can Increase greatly the 
effectiveness of backcross programs by allowing the breederto 
1) select backcross plants that have a higher proportion of RP 
genome, and 2) select backcross Individuals that Are better 
conversions near a mapped donor allele being transferred (i.e., 
select for less linkage drag). Expressed inpractical terms, using 
genetic markers to assist backcrossing can 1) Increase the 
probability of obtaining a suitable conversion, and 2) decrease 
the time required to achieve an acceptable recovery. 

Issues to consider when planning a marker-assisted back- 
cross program include 1) the time advantage of using markers 
to assist backcrossing, 2) the number of markers needed, and 3) 
the number of genotypes to evaluate. In this report, we use 
results from previous literature, computer simulation, and em* 
pineal studies to provide some guidelines. 



T*bU 1. ZxpW*4 reentry ofr*r f urrtntpanM (MP) gewm* daring 
backcnsssiAg. assu/tiitg no Itn&cge to the gvu being trwfirHd* 



Gcflcration 




F. 


50.0000 


BC, 


75,0000 


BC, 


87.5000 


DC, 


93.7500 


BC 4 


96,8750 


BC, 


98.4375 


BC, 


99.2 IBS 




99.6094 



Appendix A 

Serial No. 09/489,784 



jvloterials and methods 

The maize genome was the model for the simulation. The 
Emulated genome contained ten 200-cM chromosomes. Simu- 
lation ofcrossin g over was based on aPoisson distribution with 
a mean of 2.0 (X as 2) (Hanson, 1959), *>h)ch, on average, 
generated one cross over for every 10O-cM length. The airnula- 
oons reported here assume no interference. Codominant ge- 
netic markers were evenly distributed m the genome and sites 
0 f the donor gene were randomly assigned to genome locations. 

Simulations were conducted with the following parameters; 

Number of progeny: 100 or 500. 

Backcross generations: BC ( , BC 1( and BC r 

Number of markers: 20, 40, 80. or 100. 

Number selected to form the nextBC generation: i or 5. 

Selection was based on 1) presence of the donoralleleand 2) 
high %RP). %RP was calculated as the average of the (one or 
five) selected individuals. Values presented are the mean of 50 
simulations. 

Results 

In the computer simulation study, all methods modeled 
greatly increased the speed of recovering the RP genome 
compared to the expected recovery with no marker-assisted 
• selection (compare Tables 1 and 2). At least 80 markers were 
required to recover 99% of the RP genome in just three BC 
generations (Table 2). Use of at least 80 markers and 500 
progeny allowed recovery of 98% RP in Just two BC genera- 
tions. Response to selection was diminished only slightly by 
spreading the effort over five selection. Using markers, the 
number of backcross generations needed to convert an inbred Is 



reduced from about seven to three. 

By the BC^ generation, there appears to be no practical 
advantage to using 500 vs. 100 individuals. If the presence of 
the donor trait in the backcross individuals can be ascertained 
before markers are genocyped, then only half the number of 
individuals indicated In the tables will need to be analyzed. 

When a small number of markers arc used, they quickly 
became non-informative; i.e., selection causes the marker loci 
to became fixed for the RP type before the rest of the genome 
is fully convened (Table 3; Hospital etal., 1992). This sicuadon 
was most prominent in the larger populations, where a higher 
selection intensity placed more selection pressure upon the 
marker loci. Accordingly, it is of interest to consider how 
closely the estimation of %RP based on markers reflects rite 
actual genome composfdon. The combination of estimation of 
%RP based on fewer markers and subsequent selection lends to 
bias the estimates upward (compare Tables 2 and 3), 

The results from the simulation compare well with real field 
data. In atypical exarnpIe^SOBC^ plants carrying tKe gene being 
transferred were genotyped at 83 polymorphic RFLP loci (note 
that this corresponds to a population size of 100 unselected 
plants in Tables 2 and 3). The five best BC, recoveries had 
estimated %RP values of 85,9%, 82.7%, 82.0%, 81,4%, and 
81.2%. After evaluating lOBC^ plants from each selectedBC,, 
die best BC 2 recovery had an estimated %RP of 94.6%. 

Discussion 

The simulations (Tabic 2; Hospital et al. t 1992) and our 
experience indicate thai four markers per200-cM chromosome 
is adequate to greatly increase the effectiveness of selection in 
the BC,, However using only four markers per 200 cM will 
likely make it very difficult to map the location of the gene of 
interest, Adequate summarization of the data is an important 



Table % Percent recurrent parent genome durinj marker-assisted bavkcmssing. 



100 Progeny 500 Progtny 

No. markers No. markcra 





10 


4ft 


ao \oo 


10 


40 


to 










One selected 










BC, 


84.5 


84.5 


84,2 88.0 


89.9 


90.7 


90.2 


90.5 




95.0 


95.2 


. 95.8 97.2 


96.5 


97.7 


98.5 


98-6 


BC, 


97.4 


97.6 


98.9 99.2 


97.7 


98.3 


99.4 


99.5 








Five selected 










BC, 


82.9 


85.1 


84.9 K7 


87.7 


88.1 


88.9 


88.9 


BC, 


93.7 


95.0 


95.8 95.7 


95.5 


96.8 


97.8 


97.9 


BC, 


97.1 


98.3 


98.8 $8.9 


973 


98.5 


99.3 


99,3 



Tabic 3. Estimates of percent recurrent parent genome, hayed on nmrker tea. 



Generation 




100 


Propeny 




500 Proft 








No. markcro 




No. m«rlw 




20 


40 


80 100 


20 


40 


80 


100 








One selected 










BC, 


98.7 


97.8 


95,6 97.2 


100.0 


99.1 


98.6 


98.0 




100.0 


99.6 


99.3 99.5 


100.0 


100.0 


99.9 


98.2 








Fbtt. seUaad . 










BC, 


96.4 


" 96.5 


96.2 95.8 


100.0 


98.5 


98.3 


98.2 


BC, 


99.9 


99.8 


99.3 99.1 


100.0 


100.0 


99.9 


99.8 



42 



Analysis of Molecular Marker Data 



; part of a marker-assisted backcross program. Ideally, the mark- 
used can supply data that can be represented as alleles of loci 

* fflth known map position. Estimation of %RP, mapping the 
position of the locus of interest, and graphical display of the 
results (Young and Tankslcy, 1989) are all useful in under- 
loading and controlling the specific backcraw experiment 
being conducted. 

It appears that, with the use of genetic markers, the portion 
, of the RP genome that is not linked lo the allele being trans- 
.fcrred can be recovered quickly and with confidence. The 
recovery of KP will be slower on the chromosome carrying the 
' .gene of interest A considerable amount of linkage drag is 

• expected to accompany selection for the £>P allele in a back- 
, .cross program. For a locus located in (he middle of a 200-cM 

chromosome, the length of the DP chromosome segment ac- 
, eompanying selection is expected to be 126, 63, and 28 cM in 
' the BC , Be,, and BC 7 generations, respectively (Hanson, 
' j959;NaveiraandBarbadilla, 1992). Our observations support 

the recommendation of Hospital et al, (1992) that preference be 
,* given to the selection for recombinant proximal to the allele of 
/.interest, but that selection for recovery of the RP elsewhere In 

the genome also be considered. This two-stage selection can 

probably be done quite effectively ad hoc by the breeder once 
; the data is adequately summarized; however, Hospital ct al. 



suggest ways to incorporate the two criteria Into a selection 
index such that each component of selection Is assured appro- 
priate weighting. 

Useof genetic markerscan greatly increase the effectiveness 
of backcrossing, and they should be used in any serious back- 
crossing program if resources are available to the breeder. 

Literature Cited 

AUnrd, R.W. I960. Principles of plant breeding, Wiley* New York. 
Fthr, WJ. 1987. Principles of cultivar development: v. 1. Theory and 

technique. Macmillan, New YorJc. 
Hanson, 1959. fear \y generation analysis oflength ofhetero^ygoM 

chromosome segments- around a locus held heterozygous with 

badtcrossing or selfing. Generics 44:843-847, 
Nojpital F„ C. CheyoUt, and P, Midsanu 1992, Using marken In 

gene introgression breeding programs. Generics 132: 1 199-1210. 
Rink*, E.H and 7.C S*nti 1961. Moving corn-belt gcrroplasm 

northward. Ann. Hybrid Com Industry Conf. 16J3-16. 
Shaver, D.L 1976. Conversions for easiness in maize inbrcds. Maize 

Genet, Coop. NwjIit. 50:20-23. 
Young. N.D. and S.D. Tcudcslgy. 1989. Restriction fragment length 

polymorphism maps and the concept of graph* cat genotypes. Thco. 

Applied Cflnet 77: 95-101. 



iAnatysts aj 'Molecular Marker Dcia 



43 



